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Abstract 



, Consider a set P of N random points on the unit sphere of di- 

mension d — 1, and the symmetrized set 5 = FU (— P). The halving 
polyhedron of S 1 is defined as the convex hull of the set of centroids of 
N distinct points in S. We prove that after appropriate rescaling this 
>««✓ , halving polyhedron is Hausdorff close to the unit ball with high prob- 

ed ' ability, as soon as the number of points grows like Q(dlog(d)). From 

this result, we deduce probabilistic lower bounds on the complexity of 
approximations of the distance to the empirical measure on the point 
set by distance-like functions. 



> 
00 

'. 1 Introduction 

m 

■ The notion of distance to a measure was introduced in order to extend ex- 

isting geometric and topological inference results from the usual Hausdorff 
sampling condition to a more probabilistic model of noise [CCSM11]. Con- 
sider a finite subset P of the Euclidean space M. d and a positive number k in 
the range {1, . . . , \P\}, where |P| denotes the cardinality of P. The distance 
^ \ to the empirical measure on P is given by the following formula: 




min > \\x — Pi 

1=1 



1/2 

(1.1) 



where the minimum is taken over the sets consisting of k distinct points in P. 
We will call this function the k-distance to the point set P. Equation (1.1) 
allows to compute the value of the fc-distance at a certain point x easily, 
using a nearest-neighbor data structure. On the other hand, this formula 
cannot be used to perform more global computations, such as estimating the 
Betti numbers of a sublevel set dp. (0,r) = {x E ~R d ; dp±(x) r}, not to 
mention reconstructing a simplicial complex homotopic to this set. 

There is another representation of the fc-distance that allows to perform 
such global operations. It is computational geometry folklore that dp^ can 
be rewritten as the square root the minimum of a finite number of quadratic 
functions. More precisely dp^(x) 2 = min^ \\x — p\\ 2 +Wp, where the minimum 
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is taken over the set of centroids of k distinct points in P, and Wp is chosen 
adequately (see §2.2). This implies that sublevel sets of dp t k are simply 
union of balls, and this allows one to compute their homotopy type using 
weighted alpha-complexes or similar constructions [Ede92]. However, since 
the number of centroids of k distinct points in P grows exponentially with 
the number of points, this formulation is not very practical either. 

Fortunately, many geometric and topological inference result continue 
to hold if one replaces dp^ in the computation by a "good approximation" 
ip [CCSM11]. This means that the error \\ip — dp^W^ ■= max R( j \ip — dp t k\ 
has to be small enough, and that the approximating function p should be 
distance-like. For the purpose of this work, ip is distance-like if there exists 
a finite set of sites Q C M. d and non- negative weights (w q ) q( zQ such that 
ip = ip~Q, where ipg is defined by 

<Pq(x) := i min \\x - q\\ 2 + w q J , w q ^ 0. (1.2) 

To summarize, in order to estimate the topology of the sublevel set of the 
/c-distance, it makes sense to try to replace it by a distance-like function that 
uses much fewer sites. 

Note that one could try to find approximations of the /c-distance in a class 
of functions T different from the class of distance-like functions. It is indeed 
possible that a well chosen class of functions would produce more compact 
approximations of the /c-distance and of similar functions. However, changing 
the class of function would practically forbid to use these approximations for 
the purpose of geometric inference, because of the lack of (i) computational 
topology tools to compute with the sublevel sets of functions in T and (ii) a 
geometric inference theory adapted to this class of function. 



Complexity of /c-distance The natural formalization of our approxima- 
tion problem is as follows. Given a finite point set P in R d , a number k > 
and a target approximation error e, what is the minimum cardinality of a 
weighted point set (Q,w) with non-negative weights such that the approxi- 
mation error ipp — dp^ is bounded by e ? We call this cardinality the e- 

complexity of the /c-distance function dp : k- When e is zero, the 0-complexity 
of the /c-distance function dp^ is equal to the number of order- k Voronoi cell 
of P that have non-empty interior. One can then translate lower-bounds on 
the number of order-A; Voronoi cells into lower bounds for the 0-complexity 
of the /c-distance. The purpose of this article is to provide lower bounds on 
e-complexity of the fc-distance function for a non-zero approximation error 
e, when P is a random point cloud on the unit {d — l)-dimensional sphere 
and k = \P\ /2. 
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1.1 Prior work 

Approximation of the /c-distance The question of approximating the 
distance to the measure by a distance-like function with few sites has been 
originally raised in [GMM12]. In this article, the authors proposed an ap- 
proximation of the /c-distance, called the witnessed k-distance and denoted 
by dp fc , which involves only a linear number of sites. They also give a prob- 
abilistic upper bound on the approximation error ||dp. — dpfc||oo under the 
hypothesis that the point cloud P is obtained by sampling a ^-dimensional 
submanifold of the Euclidean space. The upper bound on the approxima- 
tion error degrade as the intrinsic dimension I of the underlying submanifold 
increases. This suggests that approximating the distance to the uniform 
measure on a point cloud drawn from a high-dimensional submanifold might 
be difficult. 

It is possible to build data structures that allow to compute approximate 
pointwise values of the /c-distance in time that is logarithmic in the number 
of points — but exponential in the ambient dimension [HPK12]. The same 
data structure can be used to compute generalizations of the /c-distance, such 
as the sum of the pth power to the /c-nearest neighbors for an exponent p 
larger than one. 

Complexity of order-/c Voronoi As mentioned earlier, there exists upper 
and lower bounds for the number of cells in an order-A; Voronoi diagrams, and 
those bounds can be translated into bounds on the number of sites that one 
needs to use in order to get an exact representation of the /c-distance function 
by a distance-like function. When k is half the cardinality of the point cloud 
P, we will speak of halving Voronoi diagram. A halving hyperplane for P 
is a hyperplane that separates P into two sets with equal cardinality. The 
number of halving hyperplanes yielding different partitions of P is a lower 
bound on the number of infinite halving Voronoi cells. The best lower bound 
on the number of halving hyperplanes, that holds for an arbitrary ambient 
dimension d, is given by N d ~ 1 e n ( y/ ^ W \ where N is the cardinality in the 
point set [TotOl]. This bound improves on previous lower bounds by several 
authors, e.g. [ELSS73, EW85, Sei87]. 

In [BS94], the authors study the expectation of the number of fc-sets of a 
point cloud P obtained by sampling independent random point on the sphere. 
Recall that a fc-set is a subset of k points in P that can be separated from 
other points in P by a hyperplane; in particular, each k-set corresponds to an 
infinite order-/c Voronoi cell. The authors prove that the expected number 
of /c-sets in a random point cloud P on the sphere is upper bounded by 
0(|P| d-1 ). 

In order to obtain our lower bounds on the number of sites needed to 
approximate dp^, we will use the notion of k-set polyhedron, originally in- 
troduced in [EVW97]. This polyhedron is the convex hull of the set of 
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centroids of k distinct points in P. The relation between this polyhedron 
and the notion of fc-set is that the number of extreme points of the fc-set 
polyhedron is equal to the number of fc-sets in the point set. When k is half 
the cardinality of P, we will call this polyhedron the halving polyhedron. 

1.2 Contributions 

The main result of this work concerns the geometry of halving polyhedra of 
finite point sets on the unit sphere. Our theorem shows that even with rela- 
tively few points, the halving polyhedron of a certain random point set S on 
the unit (d— l)-dimensional sphere is Hausdorff-close to a ball with high prob- 
ability. The random point set S is obtained by picking N random, indepen- 
dent and uniformly distributed points pi, . . . ,pn on the (d — l)-dimensional 
unit sphere, and by letting S = {±pi}i<^i^iy- The halving polyhedron of S is 
a by definition a random convex polyhedron, which we denote by L d N . The 
statement and proof of this theorem are inspired by the main theorem of 
[AAFM06]. We use the quantity 



where A is a uniformly distributed random vector on the unit (d — l)-sphere, 
and u is an arbitrary unit vector. Note that this quantity turns out not to 
depend on u. 

THEOREM 3.1. There exists an absolute constants c > such that for every 
positive number rj, the inequality 



In Section 4, we deduce from this theorem a probabilistic lower bound 
on the e-complexity of the ds n, where S is the point set defined in the 
previous paragraph. The exact statement of the lower bound can be found 
in Theorem 4.1. 

2 Traces at infinity and k-set polyhedra 

Background: support function The support function of a convex subset 
K of W 1 is a function x i-> h(A, x) from the unit sphere to R. It is defined 
by the following formula: 





holds with probability at least 1 — 2exp [c • (dlog(l/<5) — At/ 2 )] , where 5 
min(r/, 1/y/d). 



h(A, u) := max {x • u; x £ K} . 



(2.3) 
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The application that maps a convex set to its support function on the sphere 
satisfies the following isometry property: 

||h(^,.)-h(L,.)||^ d _ 1= d H (K,L). (2.4) 

where H/H^^d-i = max^d-i |/| is the infinity norm on the unit sphere, 
and where djj denotes the Hausdorff distance. The proof of this equality is 
given in Theorem 1.8.11 in [Sch93]. All the elementary facts about support 
functions that we will need can be found in the first chapter of this book. 



2.1 Traces at infinity of distance-like functions. 

We call distance-like a non-negative function whose square can be written 
as the minimum of a family of unit paraboloids \\x — q\\ +w q , with w q ^ 0. 
Note that this definition is equivalent to the one given in [CCSM11], thanks 
to the remark following Proposition 3.1 in this article. 

Given a finite subset Q of the Euclidean space and non- negative weights w, 
we let <p := pq be the distance-like function defined by (1.2). We call trace 
at infinity of p and denote by K{p) the convex polyhedra obtained by taking 
the convex hull of the set of sites Q used to define p. 

The name trace at infinity is explained by the following asymptotic de- 
velopment of the values of p, along a unit-speed ray starting at the origin, 
i.e. 7 t := tu with \\u\\ = 1: 



(p(jt) = min ( 1 1 tit - q\\ 2 + w q ) 



\V2 



qeQ 

. 2 w a + 

mint 1 u ■ q + 



2 
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qeQ \ t t 2 



= t- maxii ■ q + 0(l/t) (2.5) 
= t-h(K{<p),u) + 0(l/t) (2.6) 

The last equality follows from the fact that the support function of the convex 
hull of a set Q is given by max^gg u ■ q. Using these computations, we get 
the following lemma. This lemma implies in particular that if two functions 
tp := tpq and tp := pf, coincide on M. d , then K(ip) = K(ip). 

Lemma 2.1. Given two distance-like functions p := pg and tp := p v P as in 
(1.2), 

dH(^),K(^)^||^-Vlloo- (2-7) 



Proof. The asymptotic developments for p and tp given in equation (2.6) im- 
ply the following inequality for every unit direction u: \h(K(p),u) — h(K(ip),u)\ ^ 
\\p — V'lloo- With Equation (2.4), this yields the desired bound on the Haus- 
dorff distance between K{p) and K{%p). □ 
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The power diagram of a weighted point set (Q, w) is a decomposition of 
the space into convex polyhedra, one per point in Q, defined by 

Pow q(9) = {x £ M d ; Vp € Q, \\x - q\\ 2 +w q ^ \\x - p\\ 2 + w p }. 

The following lemma shows the relation between the vertices of the trace at 
infinity of (^q and unbounded cells in the power diagram of (Q, w). 

LEMMA 2.2. Consider a finite weighted point set (Q,w): 

(i) if the power cell of a point q in Q is unbounded, then q lies on the 
boundary of the polyhedron K((Pq); 

(i) conversely, if q is an extreme point in K((Pq), then the power cell of q 
is unbounded. 

Note that there might exist point q that lie on the boundary of the 
trace at infinity and whose power cell is empty. For instance, consider Q = 
{q~i, qo, qi} in R 2 with qi = (0, i), and weights W-i = w\ = and Wo > 1. 
Then qo lies on the boundary of the convex hull while its power cell is empty. 

Proof. Being convex, the power cell of a point q is unbounded if and only it 
contains a ray := r + t u > where u is a unit vector. By definition of the 
power cell, we have \\q — 7 t || 2 + w q ^ ||p — 74 1| 2 + u> p for all point p in the set 
Q. Expanding both sides expressions and simplifying, we get: 

\\q — r\\ 2 — \\p — r\\ 2 + 2tu • (p — q) + w q — w p ^ 

This inequality holds for t — > +00, thus implying u • p ^ u • q. Thus, q lies 
on the boundary of K((p), and u is an exterior normal vector to K(ip) at q. 

Conversely, if q is an extreme point of the convex polyhedron K(<p), there 
must exist a unit vector u, such that u ■ (p — q) ^ —e < for any point p in 
Q distinct from q. Tracing back the above inequalities, and using the strict 
bound, one can show that for any point r, the ray 74 := r + tu belongs to 
the power cell of q for t large enough. □ 



2.2 Trace at infinity of the /c-distance. 

Given a set of points P in M. d and an integer k between one and \P\, the 
k-distance to P is defined by equation (1.1). The fact that this function 
is distance-like can be seen in the following equivalent formulation, whose 
proof can be found for instance in Proposition 3.1 of [GMM12]: 

( k \ 1 ' 2 
dp,k(x) =1 m_in ^ \\x - p\\ 2 + Wp\ , (2.8) 

where the minimum is taken over the centroids p of k distinct points in P, i.e. 
V=\ J2i<:i^kPi and wnere the weight is given by Wp := \ Za^fc \\P ~ Pi\\ 2 ■ 
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Definition 2.1. We denote K%(P) the convex hull of the set of centroids of 
k distinct points in P. The support function of this polyhedron is given by 
the following formula: 

1 k 

h(K d (P),u):= max -J>- K (2.9) 

p lt ...,p k eP k f 
i=i 

This polyhedron has been introduced first in [EVW97] under the name of 
k-set polyhedron of the point set P. Equations (2.8)— (2.9) above imply that 
this polyhedron is the trace at infinity of the A;-distance to P. Moreover, the 
number of infinite ovdev-k Voronoi of P is at least equal to the number of 
extreme points in K%(P). 

Halving distance When the number of points in P is even and k is equal 
to half this number, we rename the fc-distance the halving distance. Similarly, 
we will refer to the oider-k Voronoi diagram as the halving Voronoi diagram 
and to the k-set polyhedron as the halving polyhedron. 

3 Approximating the sphere by halving polyhedra 

In this section, we consider a family of random polyhedra constructed as 
halving polyhedron of symmetric point sets on the unit sphere. More pre- 
cisely, we define: 

Definition 3.1. Given a set P of N points on the unit (d — l)-sphere, we 
define Lfj(P) as the halving polyhedron of the symmetrization of P, i.e. 

L d N (P) :=K d N (PU(-P)) 

We obtain a (random) convex polyhedron := Lfj(P), by letting P be an 
independent random sampling of N points on the unit (d — l)-sphere. 

Our main theorem consists in a lower bound on the probability of the 
halving polyhedron Lfj to be Hausdorff-close to a ball centered at the origin. 

THEOREM 3.1. There exists an absolute constants c > such that for every 
positive number rj, the inequality 

d H (—L d N ,B(0,l)) <tj 

holds with probability at least 1 — 2exp [c • (dlog(l/<5) — iVr/ 2 )] , where 5 = 
min(?7, 1/y/d). 

The proof of this theorem is postponed to Section 5. As a first corollary, 
one can show that the random polyhedron L d N is approximately round with 
high probability as soon as the cardinality of the set of points sampled on 
the sphere grows faster than dlogd, where d is the ambient dimension. 
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Corollary 3.2. For any k > 0, there is a constant C K such that for 77 > 0, 
d ^ 1/rj 2 and N ^ (log ((f) + C K ) the following inequality 

d H (—L d N ,B(0,l)) ^n 
holds with probability at least 1 — exp(— nd). 

Proof. From Theorem 3.1 the probability bound in the statement holds if 
— nd ^ c(d\og(l/5) — Nrj 2 ). Since 5 is equal to l/Vd, this is the case if 
Nr? >d(log(d) + ^). □ 



4 Application: approximation of distance-to-measures 

The results of the previous section can be used to obtain a probabilistic 
statement on the complexity of the halving distance to a random point set 
on a high-dimensional sphere. We call e-complexity of a distance-like function 
ip : M. d — >■ M the minimum number of sites that one needs in order to be able 
to construct a distance-like function ip such that the infinity norm \\ip — V'lloo 
is at most e, i.e. 

A%,£) :=min{|Q|; \\<p - ^ e, <p% as in (1.2)}. 

The following theorem provides a probabilistic lower bound on the e-complexity 
of a family of distance-like functions. 

THEOREM 4.1. For any constant k > there exists a constant C(n) such 
that the following hold. Let 7] > 0, d ^ l/n 2 , and S be the symmetrization 
of an random point cloud of cardinality N = ^-(log(d) + C K ) on the unit 
sphere. Then, the inequality 

d-l 

N (ds,N, m dV ) ^ 2\fd ( — — j 
V64d(log(d) + C K )J 

holds with probability at least 1 — exp(— nd). 

Taking k = 1 and d = l/n 2 , this implies: 

COROLLARY 4.2. There exists a sequence of point clouds of cardinality 
d 2 (log(d) + C\) on the sphere S^ 1 such that 
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Proof of Theorem J^.l. The halving polyhedron Lfj(S) is by definition equal 
to the trace at infinity of the halving distance K(ds,N)- Therefore, we can 
apply Corollary 3.2: there exist a constant C K such that for a random point 
set S distributed as in the statement of the theorem, the following inequality 
holds: 

d H (—K(d s , N ), B(0, 1)) < V . (4.10) 

with probability at least 1 — exp(— nd). We now consider a deterministic 
point set S on the sphere that satisfies the above inequality, and we con- 

1 1 2 

sider a distance- like function min g6 Q \\x — q\\ + w q such that the 

approximation error — dg jv||oo is bounded by m d r). By definition of the 
complexity of ds t N, our goal is to prove a lower bound on the cardinality of 
the point set Q. Using the triangular inequality for the Hausdorff distance 
first, and then Lemma 2.1.(ii), we get the following inequalities: 

d H (—K(^),B(0,1] 

< — d K (Kty), K(d s , N )) + d H (—K(d s , N ),B(Q, 1) 
m d \m d 

< IIV 7 - ds,Ar||oo + V < 277 

Now, recall that K{ip) is equal to the convex hull of the point set Q. If 
we define R as the rescaled set {m^q; q 6 Q}, the above inequality reads 
d H (conv(i?),B(0,l)) < 2r). 

We have thus constructed a polyhedron that is within Hausdorff dis- 
tance 1r\ of the ci-dimensional unit ball. The last remark of [BI75] gives the 
following lower bound on the number of vertices of such a polyhedron: 

\Q\ = \R\ ^ 2Vd(8»/)-^ 

To conclude the proof, we simply replace rj by its expression in term of the 
number of points N and the dimension d, i.e. if := d(\og(d) + C K )/N to 
obtain 

d-l 

5 Proof of the main theorem 

We start this section by showing a simple expression for the support function 
of the halving polyhedra of a symmetric point set. For a point set P, and 
N := \P\, one has: 

h(L d N (P),u) = J2\P-u\- (5-H) 

PGP 
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Proofof Equation (5.11). Let S = P U { — P} and recall that by definition, 
the support function h(L^(P),u) is equal to max n • pi, where the 
maximum is taken on sets of N distinct points in S. Choosing e(p) = ±1 
such that for every point in p in P, e(p)u • p ^ 0, one easily sees that this 
maximum is attained for {pi, . . . ,Pn} = {s(p)p;p € P}. □ 

This computation motivates the following definition. 

Definition 5.1. We let Xf u be the measure on [0, 1] given by the distribution 
of \u • X\ where X is a uniformly distributed random vector on the {d — 1)- 
dimensional unit sphere, and u is a fixed unit vector. This measure turns 
out not to depend on u, so we will denote it by Xf. 

The random polyhedron Lfj is defined as Lfj(P) where the point set P 
is obtained by drawing N independent points on the unit sphere. Equa- 
tion (5.11) implies that for any unit vector u the distribution of values of 
h(Lfj,u) is given by the formula: 

i=i 

where the (Yj) are iV independent random variables with distribution Xf. 

Lemma 5.1. The measure Xf has the following properties: 

(i) Xf is absolutely continuous with respect to the Lebesgue measure, with 
density 

f d (t) := Cd (l-t 2 )^ 
the constant c d being chosen so that f d is the density of a probability 

rl / 9\ d ~ 2 

measure, i.e. Cd J (1 — t ) 2 at = 1. 

(ii) the mean of Xf is given by := ^. Moreover, is equivalent to 
y / 2/(7rd) as d — > 00 . 

(Hi) The variance o\ of Xf is equivalent to (1 — 2/ir)/d as d — > 00. 

Proof, (i) Using Pythagoras theorem, one checks that the intersection of the 
hyperplane {x G M. d ; u ■ x = i) with the unit sphere is a (d — 2)-dimensional 
sphere with squared radius (1 — t 2 ). This implies the formula for f d , for a 
certain constant c&. To compute this constant one uses the fact that X\ has 
unit mass, i.e. 

c d [ (l-t 2 )^dt = l. 
Jo 

Note that a formula of Wallis asserts that 

lim Vd [\l-t 2 )l = 0^2. 

d ^°° Jo 
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This implies that c d ~ y / 2d/n. 

(ii) The function 1 1- > tfd(t) admits an explicit primitive: 

Ut) :=-^(l-t 2 )l, 

so that rrid = gaiX) — 9d{fy is as i n the statement of the lemma. Thus, we 
get m d ~ ^2/{'Kd). 

(hi) Integrating the following inequality between and 1 



t\l-t 2 ) L 



:i-t 2 )(i-t 2 )^, 



one gets the following formula for the second moment of \ d : 



Cd 



l\ 2 {l-t 2 )- 
Jo 



-dt = 1 



Cd 

cd+2' 



Moreover, an integration by part gives 



Cd 



JO 



t 2 (l-t 2 )^dt = c ^ / {l-t 2 )idt= 1 -^ 

a Jo dc d+2 



These two equalities imply that c d /c d+ 2 = d/(d+ 1), and using the formula 
for the mean given above, we get: 



2 1 cj 1-2/vr 

(J , = — — rsj 

d d d 2 d 



□ 



Lemma 5.2. There exists a universal constant c > such that for any di- 
mension d, any N > and any set of directions U in S d ~ l , one has 

F (max h(L d N ,u) - m d \ ^ 7]m d ) < 2 \U\ exp {-c- Nrf) . 

Proof. Consider N random variables Y\, . . . , Yn with distribution Xf. These 
random variable are bounded by 1 and their variance is cr 2 ,. Applying Bern- 
stein's inequality gives: 



1 N 

-TYi- 
N 4-f 



m d 



i=l 



^ e < 2 exp 



-Ne 2 



This implies that for a fixed direction u, 



( h(L^,u) - m d 



^ ^ 2 exp 



2a 2 + 2e/i) 



2a 2 + 2rim d /3j 
Rescaling everything by a certain constant k > 0, one gets 



> V m d 



h(nL%,u) - Km d ^ nr/m^ 

< 2exp f -^«Vmg \ 
^ 2eXP V2K 2 a 2 + 2 W /3y' 
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Letting n go to infinity, we obtain the following bound 

P (|h(L&,u) - m d \ ^ nma) s$ 2exp ^-^N V 2 ^j 

This gives the desired estimate for a single direction using the two estimates 
rrid = 0(d~ 1 / 2 ) and o\ = 0(d _1 ). The conclusion of the lemma is obtained 
by a simple application of the union bound. □ 

LEMMA 5.3. If K is contained in the ball B(0, r), the support function h(K, .) 
is r-Lipschitz. 

Proof. Consider u in the unit sphere, and x in K such that h.(K, u) = u • x. 
For any vector v in the unit sphere, 

h.(K, v) = max v-y^v-x = u- x+(v — u) ■ x 

^ h(K, u) — \\u — v\\ \\x\\ 
^ h.(K, u) — r \\u — v\\ . 

Swapping u and v gives the Lipschitz bound. □ 

A subset U of the unit sphere is called a 5 -sample or a (5-covering 

if the union of the Euclidean balls of radius 5 centered at points of U cover 
the unit sphere. 

LEMMA 5.4. Consider a convex set K contained in the unit ball £?(0, 1), and 
two numbers A, rj € (0, 1). Moreover, suppose 

max \h(K, u) - A| ^ r/A, (5.13) 

where U is a 5-sample of the unit sphere, with 5 := min(A,?7). Then, the 
Hausdorff distance between jK and the unit ball is at most 5rj. 

Proof. Note that, almost by definition, a convex set K is included in the ball 
£>(0, r) if and only if its support function satisfies ||h(K, ^ r. Assuming 
that the convex set K is contained in some ball 13(0, r), we have: 

\\h(K,.)-X\\ OD = max \h(K,v) - X\ 



^ max 

veS 11 - 1 



min \h(K, u) - h(K, v)\ + \h(K, u) - A| 



^ r min(A, rf) + 7]X. (5-14) 

The first inequality is obtained by applying the triangle inequality, while 
the second one follows from the Lipschitz estimation of Lemma 5.3, the fact 
that U is a 5-sample of the unit sphere and from Equation 5.13. Applying 
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Inequality (5.14) with r = 1, rj < 1 and using the triangle inequality we 
get ||h(if, .) — A|| ^ 2A. This implies that K is contained in the sphere 
13(0, 3A) and allows us to apply the same inequality (5.14) again with the 
smaller radius r = 3A, implying 

||h(K,.)-A|| 00 ^3Ar ? + 2Ar ? = 5Ar ? . 

Dividing this last inequality by A, and using Equation (2.4) implies the 
conclusion of the Lemma. □ 

Proof of Theorem 3.1 Consider 5 = min(?7, md), and let U be a (5-sample 
of the unit sphere with minimal cardinality. The cardinality of such a sample 
is bounded by \U\ ^ (const/5) d , where the constant is absolute. Applying 
Lemma 5.4 first and then Lemma 5.2 gives us the following inequalities: 



1 



d H —Kff, 13(0, 1) >5j/UP max h(L%, u) - m d 
m d J J \ueu 

^ 2|[/|exp(-c- Nrf 2 ) . 



> rrm d 



We conclude the proof by applying the upper bound on the cardinality of 
the 5-sample U stated above, and by using the equivalent m d ~ y / 2/(ird). 



6 Extension to other values of 

It is possible to extend some of the results to cases where ratio k/N is 
different from one half. Let us consider the random polytope Mfj k = Kf(P), 
where P is a set of N random points sampled uniformly and independently 
on the (d— l)-dimensional unit sphere, and k is between 1 and N. Lemma 5.2 
can be partially extended to this more general family of random polytopes. 
Note however that the statement below does not provide any estimate for 
the radius r(d,N,k) as the number of points N grows to infinity with k/N 
remaining constant. In particular, it is not precise enough to generalize the 
lower bounds of Theorem 3.1. 

Lemma 6.1. For any dimension d, and any numbers N and k, there exist 
a value r := r(d,N,k) such that for any set of directions U in S^ 1 , and 
N > one has 



( 



max 



mr{,,.u) - i > ,/r) sC 2 \U\vxp[-~ V 2 r 2 



This lemma follows from a version of Chernoff's inequality adapted to 
Lipschitz functions. Consider a function F from the cube [—1,1]^ to R, 
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which is a-Lipschitz with respect to the i 1 norm on the cube. Then, for any 
family Y\, . . . , Y/v of i.i.d. random variables taking values in [—1, 1] one has: 

F(\F(Y 1 ,...,Y N )-EF\ >e) ^ 2ex P (^f^) . (6.15) 

Proofof Lemma 6.1. We consider the map Ffc from the cube [—1, 1]^ to R 
defined by: 



F&(x) := max I \^x ij ; 1 < h < . . . < i k ^ N 



where Xi denotes the ith coordinate of x. We also consider the measure fx 
obtained by pushing forward the [d — l)-area measure on the unit sphere of 
R d by the map x i— y u ■ x, for some direction u in the unit sphere. As in 
the halving case (cf (5.12)), the distribution of h(M^ fc , u) is given by the 
distribution of Ffc(Yi, . . . , Y]v), where Yi, . . . Yjv are i.i.d random variables 
with distribution fj,. 

Moreover, the map F^ is ^-Lipschitz with respect to the i 1 norm on the 
cube [—1, 1]^. Indeed, given two points x,y in [—1, 1]^ and a set of indices 
%\ < . . . < if- corresponding to the maximum in the definition of F^(x), one 
has 

kF^(x) = x h + . . . + x ik ^y il + ...+y ik + \\x - y\\ e i 

< kF&(y) + \\x - y\\ el 

Thus, we can apply Chernoff's inequality (6.15): 

s 2 k 2 ' 



(\Fb(Y u ...,Y N )--EF&\ >e) < 2exp 



4A 



where m = k/N. The Lemma follows by setting r := EFJy-, e = rjr in the 
equation and by using the union bound. □ 
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